Conditional expectation estimation through attributable components

نویسنده

  • ESTEBAN G. TABAK
چکیده

A general methodology is proposed for the explanation of variability in a quantity of interest x in terms of covariates z = (z1, . . . ,zL). It provides the conditional mean x̄(z) as a sum of components, where each component is represented as a product of non-parametric one-dimensional functions of each covariate zl that are computed through an alternating projection procedure. Both x and the zl can be real or categorical variables; in addition, some or all values of each zl can be unknown, providing a general framework for multi-clustering, classification and covariate imputation in the presence of confounding factors. The procedure can be considered as a preconditioning step for the more general determination of the full conditional distribution ρ(x|z) through a data-driven optimal-transport barycenter problem. In particular, just iterating the procedure once yields the second order structure (i.e. the covariance) of ρ(x|z). The methodology is illustrated though examples that include the explanation of variability of ground temperature across the continental United States and the prediction of book preference among potential readers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CONDITIONAL EXPECTATION IN THE KOPKA'S D-POSETS

The notion of a $D$-poset was introduced in a connection withquantum mechanical models. In this paper, we introduce theconditional expectation of  random variables on theK^{o}pka's $D$-Poset and prove the basic properties ofconditional expectation on this  structure.

متن کامل

Estimation of Generalized Multisensor

| This paper attacks the problem of generalized multisensor mixture estimation. A distribution mixture is said to be generalized when the exact nature of components is not known, but each of them belongs to a nite known set of families of distributions. Estimating such a mixture entails a supplementary diiculty: one must label, for each class and each sensor, the exact nature of the correspondi...

متن کامل

A Kernel Approach to Estimating the Density of a Conditional Expectation

Given uncertainty in the input model and parameters of a simulation study, the goal of the simulation study often becomes the estimation of a conditional expectation. The conditional expectation is expected performance conditional on the selected model and parameters. The distribution of this conditional expectation describes precisely, and concisely, the impact of input uncertainty on performa...

متن کامل

Estimation of Generalized Multisensor Hidden Markov Chains and Unsupervised Image Segmentation

This paper attacks the problem of generalized multisensor mixture estimation. A distribution mixture is said to be generalized when the exact nature of components is not known, but each of them belongs to a finite known set of families of distributions. Estimating such a mixture entails a supplementary difficulty: One must label, for each class and each sensor, the exact nature of the correspon...

متن کامل

Class Conditional Density Estimation Using Mixtures with Constrained Component Sharing

We propose a generative mixture model classifier that allows for the class conditional densities to be represented by mixtures having certain subsets of their components shared or common among classes. We argue that, when the total number of mixture components is kept fixed, the most efficient classification model is obtained by appropriately determining the sharing of components among class co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017